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rioned DN A sequences related to the oanomlc »NA of lvmoha- 

n»nnptthv -»i«oeiit>a-olru«_LLAYJ_«nd arattim encoded bv 

tff<* LAV aanoiwie SNA 

The invention relates to cloned DNA sequences 
indistinguishable from gonomie RNA and DMA of lympha- 
danopathy-associatad virus (LAV), a process for their 
preparation and their usee. It relatea more particularly 
to stable probes including a DMA sequence which can be 
used for the detection of the LAV virus or related viruses 
or DNA proviruses in any medium, particularly biological 
samples containing any of them. The invention also relates 
to polypeptides, whether glycosylated or not, encoded by 
said DNA sequences. 

Lymphadenopathy-associsted virus (LAV) is a human 
retrovirus first isolated from the lymph node of a homo¬ 
sexual patient with lymphadenopathy syndrome, frequently a 
prodrome or a benign form of acquired immune deficiency 
syndrome (AIOS). Subsequently other LAV isolates have been 
recovered from patients with AIOS or pre-AIDS. All availa¬ 
ble data are consistent with the virus being the causative 
agent of AIDS. 

A method for eloning such ONA sequences has alrea¬ 
dy been disclosed in British Patent Application Nr. 

04 236S9 filed on September 19. 1904. Reference is here¬ 

after made to that application as concerns subject matter 
in common with the further improvements to the invention 
disclosed herein. 

The present invention aims at providing additional 
new means which should not only also be useful for the 
detection of LAV or related viruses (hereafter more 
generally referred to as 'LAV viruses'), but also have 
more versatility, particularly in detecting specific parts 
of the genomic DNA of said viruses whose expression pro¬ 
ducts are not always directly detectable by immunological 
methods. 

The present invention further sims st providing 
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p 0 lyp«ptid«* containing aoquoncot in common with polypep¬ 
tide* encoded by tho LAV gonomic RNA. Zt rolatoc •von mora 
particularly to polypaptidaa eompriting antiganic deter¬ 
minants includad in tha protaina ancodad and expressed by 
tha LAV ganoma occuring in natura. An additional object of 
tha invention ia to -further provide meana for tha 
detection of protaina related to LAV virua. particularly 
for the diagnoaia of AIDS or pre-AZDS or. to the contrary, 
for the detection of antibodiea againat the LAV virua or 
protaina related therewith, particularly in patianta 
afflicted with AZOS or pre-AZ05 or more generally in 
aaymtomatie carriera and in blood-related producta. 
finally the invention alao alma at providing immunogenic 
polypaptidaa, and mora particularly protective 
polypaptidaa for use in tha preparation of vaccine 
eompoeitiona againat AIDS or related ayndroms. 

Tha preaent invantion relatea to additional ONA 

P 

fragmanta, hybridizable with the genomic RNA of LAV aa 
they will be diaeloaed hereafter, aa well aa with additio¬ 
nal cDNA varianta eorreaponding to the whole genomee of 
LAV viruaea. Zt further relatea to ONA recombinants con¬ 
taining aaid DNAa or cONA fragmenta. 

The invention relatea more particularly to a cDNA 
variant eorreaponding to the whole of LAV retroviral 
genomee, whieh ia characterized by a aeriea of reatrietion 
sitea in the order hereafter (from the 5* end to the 3’ 
end). 


The coordinates of the aueeeaaive aites of the 
whole LAV genome (restriction map) are indicated hereafter 
too. with respect to the Hind I1Z site (selected as of 
coordinate 1) which is located in the R region. The 
coordinates are estimated with an accuracy of 1 200 bp': 


Hind ZZZ 
Sac I 
Hind III 
fat I 
Hind III 


0 
SO 
920 
800 
1 100 




* 
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Bgl X X 

1 

500 

Kpn X 

3 

500 

Kpn X 

3 

900 

Eeo RX 

4 

100 

Eco RX 

5 

300 

Sal I 

5 

500 

Kpn I 

B 

100 

Bgl II 

B 

500 

Bgl XX 

? 

800 

Hind XXX 

7 

850 

Bam HX 

B 

150 

Xho X 

8 

800 

Kpn I 

8 

700 

Bgl XX 

8 

750 

Bgl XI 

9 

ISO 

Sac X 

9 

200 

Hind XII 

9 

250 

Another DMA 

variant according to this invention 


optionally contains an additional Hind IZI approximately 
20 at the S 550 coordinate. 

Reference ia further made to fig. 1 which shows a 
more detailed restriction map of said whole-ONA (AJ19). 

An even more detailed nueleotidie sequence of a 
preferred DMA according to the invention is shown in fig. 

29 4-12 hereafter. 

The Invention further relates to other preferred 
ONA fragments which will be referred to hereafter. 

Additional features of the invention will appear 
in the course of the non-limitative disclosure of additio- 

30 nal features of preferred DMAs of the invention, as well 
as of preferred polypeptides according to the invention. 
Reference will further be had to the drawings in whieh : 

- fig. 1 is the restriction map of a complete LAV genome 
(clone AJ19) ; 

figs. 2 and 3 show diagrammatically parts of the three 
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possible reading phase* of LAV genomic RNA, including the 
open reading frames (ORF) apparent in each of said reading 
phases ; 

• figs. 4-12 show the successive nucleotidic sequences of 
5 a complete LAV genome. The possible peptidie sequences in 
relation to the three possible reading phases related to 
the nucleotidie sequences shown are also indicated s 

figs. 13-18 reiterate the sequence of part of the LAV 
genome containing the genes coding for the enveloppe pro¬ 
to tainsi with particular boxed peptidie sequences which cor¬ 
responds to groups which normally carry glycosyl groups. 

The sequencing and determination of sites of par¬ 
ticular interest was carried out on a phage recombinant 
corresponding to AJ19 disclosed in the aboveaaid British 
IS Patent application Nr. 84 238S9. A method for preparing it 
is disclosed in that application. 

The whole recombinant phage DMA of clone AJ19 
(disclosed in the earlier application) was sonicated 
according to the protocol of DEXNXN8ER (1983). Analytical 
20 Biochem. 129, 218. the ONA was repaired by a Klenow 

reaction for 12 hours at 18"C. The ONA was eleetrophoresed 
through 0.8 Z agarose gel and ONA in the size range of 
300-600 bp was cut out and electroeluted and precipitated. 
Resuspended ONA (in 10 mM Trie. pH 8 t 0.1 mM E0TA) was 

29 ligated into Ml3mp8 RF ONA (eut by the restriction enzyme 
Smal and subsequently alkaline phosphated). using T4 ONA- 
and RNA-ligasea (Maniatis T et al (1982) - Molecular 
cloning - Cold Spring Harbor Laboratory). An £. ggli 
strain designated as TGI was used for further study. This 

30 strain has the following genotype : 

Alac pro, supE. thi.F*tra036, proAB, lacl^, ZAMlS.r 

This coli TGI strain has the peculiarity of 

enabling recombinants to be recognized easily. The blue 
colour of the cells transfected with plasmids which did 
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not raeombina with a fragment of LAV DMA la not modified. 
To tho contrary eolla tranafoetad by a rocombinant plasmid 
containing a LAV ONA fragment yiold whit* coloniaa. Tha 
eaehniqua which wat uaad ia diaeloaad in Sana (1983), £&.. 

5 101 . 

Thia atrain waa tranaformad with tha ligation mix 
uaing tha Hanahan mathod (Hanahan 0 (1983) J. Hoi. Biol. 
168. 997). Calla wara platad out on tryptona-agarosa Plata 
with XPT6 and X-gal in aoft agaroaa. Whita plaquas wara 

10 aithar piekad and acraanad or acraanad diractly ia situ 
uaing nitrocalluloaa filtara. Thair ONAa wara hybridizad 
with nick-tranalatad ONA inaarta of pUC18 Hind IIX 
aubclonaa of AJ19. thia parmittad tha iaolation of tha 
plasmids or aubclonaa of A which ara idantifiad in tha 

19 tabla haraaftar. In ralation to thia tabla it should also 
ba notad that tha daaignatlon of oach plaamid ia followad 
by tha daposition numbar of a call cultura of £. enii TGI 
containing tha corrasponding plaamid at tha "Collaction 
Nationala das Culturas da Hiero-organiamas" (C.N.C.M.) of 

20 tha Paataur Inatituta in Paris, Franca. A non-transformad 
TGI call lina was also dapositad at tha C.N.C.H. undar Nr. 
1*364. All thasa dapoaits took plaea on Novambar 19, 1964. 
Tha aizoa of tha corraaponding inaarta dsrivad from tha 
LAV ganoma hava also baan indieatad. 
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TABLE 

Essential features of the recombinant plasmids 

pJ19 - 1 plasmid (Z-36S) O.S kb 

Hind III - Sac Z - Hind III 

pj19 - 17 plasmid (1-367) O.S kb 

Hind III - Pst 1 - Hind-III 

pJI9 - 6 plasmid (1-366) 1.5 kb 

Hind III (S') 

8 am HI 
Xho I 
Kpn I 
89 I II 
Sac I (3* ) 

Hind III 

PJ19-13 plasmid (1-368) 8.7 kb 

Hind III (5*) 

891 II 
Kpn I 
Kpn I 
Eeo 81 
Eco 81 
Sal I 
Kpn I 
891 II 
891 II 

Hind III (3*) 



Positively hybridizing'Ml3 phage plates were grown 
u p for 5 hours and the single-stranded DNAs were 
extracted. 

M13mp0 subclones of AJ19 DNAs were sequenced 
according to the dideoxy method snd technology devised by 
Ssnger et al (Sanger et al (1977). Proc. Natl. Acad. Sci. 
USA, X£ 9483 and Ml3 cloning and sequencing handbook, 
AMERSHAH (1983). the 17-mer oligonucleotide primer 
a . 3S SdATP (iOOCi/mmol. AMERSHAH). end O.SX-SX buffer 
gradient gels (Siggen M.O. et al (1983, Proc. Natl. Acad. 
Sci. USA, 90 ■ 3983) were used. 6els were read and put into 
the computer under the programs of Staden (Steden R. 
(1982). Nucl. Acids Res. 10. 4731). All the appropriate 
references and methods can be found in the AMERSHAH H13 
cloning and sequencing handbook. 

The complete sequence of AJ19 was deduced from the 
experiments as further disclosed hereafter. 

Pigs. 4-12 provide the DNA nucleotidie sequence of 
the complete genome of LAV. The numbering of the 
nucleotides starts from a left most Hind XXX restriction 
site (S' AAG..) of the restriction map. The numbering 
oeeurs in tens whereby the last zero number of each of the 
numbers occuring on the drawings is located just below the 
nucleotide corresponding to the nucleotides designated, 
l.e. the nucleotide at position 10 is T, the nucleotide at 
position 20 is C, etc.. 

Above each of the lines of the successive nucleo- 
tidic sequences there are provided three lines of single 
letters corresponding to the aminoaeid sequence deduced 
from the DNA sequence (using the genetie eode) for each at 
the three reading phases, whereby said single letters have 
the following meanings. 

A : alanine 
R : arginine 
K : lysine 
H : histidine 
C : cysteine 
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H : methionine 
N : tryptophan 
P : phenylalanine 

Y : tyroaina 
L : leucine 

V i valina 

1 : iaolaucina 
G : glycine 
T : thrionine 
S : aarlna 
E : glutamic acid 
0 i Aapartie acid 
N : aaparagina 
0 t glutamina 
P : prolina. 

Tha aatarlk signs * *" correspond to stop codons 
(i.e. TAA, TAG and TGA). 

Starting above tha first line of tha DMA 
nucleotidie sequence of fig. t tha three reading phases 
are respectively marked *1", "2", *3". on the left 
handside of tha drawing. Tha same relative presentation of 
the three theoritical reading phases is then used ell over 
the sueeessives lines of the LAV nucleotidie sequence. 

Figs. 2 and 3 provide a diagrammatized represen* 
tation of the lengths of the successive open reading 
frames corresponding to the successive reading phases 
(also referred to by numbers * 1 ’, *2* and ’ 3 ‘ appearing in 
the left handside part of fig. 2). The relative positions 
of these open reading frames (ORF) with respect to the 
nucleotidie structure of the LAV genome is referred to by 
the scale of numbers representative of the respective 
positions of the corresponding nucleotides in the DNA 
sequence. The vertical bars correspond to the positions of 
the corresponding stop codons. 

1) The "oao oene“ (or ORP*qaq> 

The "gag gene’ codes for core proteins. 
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particularly it appears that a genomic fragment (ORF-gag) 
thought to eodo for tha eora antigana including the p2S, 
pi 8 and pi 3 pretaina ia located between nucleotidic 
poaition 238 (atarting with 9' CTA 8C6 6A6 3‘) and 
nucleotidic poaition 1799 (ending by CTC6 TCA CAA 3'). The 
structure of the peptidea or proteins encoded by parts of 
said ORP ia deemed to be that corresponding to phase 2. 

The methionine aminoaeid *M* coded by the AT6 at 
position 260-282 ia the probable initiation methionine of 
the gag protein precursor. The end of ORF-gag and 
accordingly of gag protein appears to be located at 
position 1799. 


The beginning of p29 protein, thought to ttart by 
a P-I-V-fl-N-I-O-G-O-M-V-H aminoaeid sequence is 

thought to be coded for by the nucleotidic sequence 
CCTATA..., starting at position 696. 

Hydrophilic peptides in the gag open reading frame 
are identified hereafter. They are defined starting from 
aminoaeid 1 * Het (H) coded by the AT6 starting from 260-2 
in the LAV ONA sequence. 

Those hydrophllie peptides are 
12-32 aminoacids inclusive 
37-46 
49-79 
88-193 
196-169 
1T8-188 
200-220 
226-234 
239-264 
266-331 
392-361 
377-390 
399-432 
437-494 
492-498 
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The invention alto relates to any combination of 


these peptides. 


Figs. 4-12 also show that tha ONA fragmantt 
extending from nuclaotidic position 1SSS (starting with 
S'TTT TTT ....3' to nuclaotidic position 3066 is thought 
to correspond to tha pol gana. Tha polypeptidic structure 
of tha corresponding polypeptides is deemed to be that 
corresponding to phase 1. It stops at position 4S63 (end 
by 5'6 GAT GAG GAT 3*). 

These genes are thought to code for the virus 
polymerase or reverse transcriptase. 


The ONA sequence thought to code for envelope 
proteins is thought to extend from nucieotidlc position 
5670 (starting with 5*AAA GAG GA6 A....3*) up to nucleo- 
tidic position 6132 (ending by ....A ACT AAA GAA 3). 
Polypeptidic structures of sequences of the envelope 
protein correspond to those read according to the ‘phase 
3* reading phase. 

The start of env transcription is thought to be at 
the level of th ATG codon at positions 5691-5693. 

Additional feature of the envelope protein coded 
by the env genes appear on figs. 13-16. These are to be 
considered as paired figs. 13 and 14 ;15 and 16 j 17 and 
16 respectively. 

It is to be mentioned that because of format 
difficulties. 

Fig. 14 overlaps to some extent with fig. 13. 

Fig. 16 overlaps to some extent with fig. IS. 

Fig. 16 overlaps to some extent with fig. 17. 

Thus for instance figs. 13 and 14 must be con¬ 
sidered together. Particularly the sequence shown' on tha 
first line on the top of fig. 13 overlaps with the 
sequence shown on the first line on the top of fig. 14. In 
other words the ‘starting of the reading of the successive 



,«qusnc^^ of the env gene at represented in figs. 13-18 
involves first reading tha first lina at tha top of fig. 
13 than proceeding furthar with tha firtt iina of fig. 14. 
Ona than raturnt to tha baginning of tha taeond lina of 
fig. 13. than again furthar proeaad with tha raading of 
the taeond lina of paga 14. ate... Tha aama observation! 
than apply to the raading of tha paired figt. IS and 16, 
and paired figa. 1? and 16. respectively. 

Tha locations of nautralizing epitopes are furthar 
apparent in figt. 13-18. reference la more particularly 
made to tha boxed groups of three letters included in tha 
aminoacid sequences of tha envelope proteins (raading 
phase 3) which can be designated generally by tha formula 
H-X-S or N-X-T, wherein X is any other possible aminoacid. 
Thus the initial protein product of the env gene in a gly¬ 
coprotein of molecular weight in excess of 91.000. These 
groups are deemed to generally carry glycosylated groups. 
Those H-X-S and N-X-T groups with attached glycosylated 
groups form together hydrophylic regions of the protein 
and are deemed to be located at the periphery of and to be 
exposed outwardly with respect to the normal conformation 
of the proteins. Consequently they are considered as being 
epitopes which can efficiently be brought into play in 
vaccine compositions. 

The invention thus concerns with more particulari¬ 
ty peptide sequences included in the env-proteins and 
excizable therefrom (or having the same aminoacid struc¬ 
ture), having sizes not exceeding 200 aminoacids. 

Preferred peptides of this invention (referred to 
hereafter as a, b, c, d, e, f) are deemed to correspond to 
thoso encoded by the nucleotide sequences which extend 
respectively between the following positions : 

a) from about BOSS to about 8200 ' 

b) • • 6280 • * 6310 ^ 

Cl ’ * 6390 • * 6440 " 
d) ’ * 648S " 


6620 ✓ 
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Other hydrophilic peptides in tho onv opon reading 
from* art identified hereafter, they are defined atarting 
from 

aminoacid 1 • lyaine (K) coded by the AAA at poaition 
3670*2 in the LAV ONA sequence. 

Theae hydrophilic peptidea are 
8*23 aminoacida incluaive 
63-78 
82-90 
07-123 
127-183 
197-201 
239-294 

3QQ-32T ‘ • 

334-381 

397-424 

468-900 

910-923 

951-977 

894-603 

821-630 

697-679 

719-758 

780-603 

The invention also relates to any combination of 
theae peptidea. 


The invention further concerns ONA sequences which 
provide open reading frames defined as 0RF-Q, ORF-R and aa 
*1*. "2*. *3", *4", •O", the relative position of which 
appears more particularly in figs. 2 and 3. 

These ORFs have the following locations s 


ORF-Q 

0RF-R 


phase 1 


start 4478 


atop 5086 
' 9696 
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ORP- 1 

«• 

1 

‘ 5029 

" 5316 

ORF-2 

« 

2 

• 5273 

* 5515 

ORP-3 

t» 

1 

• 3393 

’ 3616 

ORF-4 

» 

2 

" 5519 

* 5773 

ORP-5 

m 

1 

’ 7968 

" 6279 


The LTR 

(long 

terminal repeats) can be defined as 

lying 

between 

position 8S90 and 

position 160 (end exten- 


ding over position 9097/1). As s matter of faet the and of 
the genome is at 9097 and, because of the LTR structure of 
10 the retrovirus, links up with the beginning of the 
sequence : 

Hind III 

CTCAATAAAGCTT6CCTTG 

n 

IS 9097 1 

The invention concerns more particularly all the 
ONA fragments which have been more specifically referred 
to hereabove and which correspond to open reading frames. 
It will be understood that the man skilled in the art will 
20 be able to obtain them all, for instance by cleaving an 
entire DMA corresponding to the complete genome of a LAV 
species, such as by cleavage by a partial or complete 
digestion thereof with a suitable restriction enzyme and 
by the subsequent recovery of the relevant fragments. The 
25 different DNAs disclosed in the earlier mentioned British 
Application can be resorted to also as a source of sui¬ 
table fragments. The techniques disclosed hereabove for 
the isolation of the fragments which were then included in 
the plasmids referred to hereabove and which were then 
30 used for the DMA sequencing can be used. 

Of course other methods can be used. Some of them 
have been exemplified in the earlier British Application, 
reference is for instance made to the following methods. 

a) DNA can be transfected into mammalian cells 
3S with appropriate selection markers by a variety of tec¬ 
hniques, calcium phosphate precipitation, polyethylene 
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glycol, protoplast-fusion, etc.. 

b) ONA fragments corresponding to gonoa can bo 
cloned into expression voctora for £. coll , yaaat- or 
mammalian colla and the raaultant protoina purifiod. 

5 e) Tho provlval ONA can bo "shot-gunned" (frag¬ 

mented) into procaryotic oxproaaion voetors to gonorato 
fusion polypoptidoa. Recombinant producing tntigonieally 
competent fusion proteins can bo identified by aimply 
screoning tho recombinants with antibodies against LAV 
to antigens . 

The invention also relates more specifically to 
cloned probes which can be made starting from any ONA 
fragment according to this invention, thus to recombinant 
ONAs containing such fragments, particularly any plasmids 

19 amplifiable in procaryotic or eucaryotic cells and carry¬ 
ing said fragments. 

Using the cloned ONA fragments as a molecular hy¬ 
bridization probe - either by marking with radionucleo- 
tidea or with fluorescent reagents - LAV virion RNA may be 

20 detected directly in the blood, body fluids and blood 
products (e.g. of the antihemophylic factors such as 
Factor VtlZ concentrates) and vaeeinea, l.e. hepatitis B 
vaccine. It has already been shown that whole virus ean be 
detected in culture supernatants of LAV producing cells. A 

25 suitable method for achieving that detection comprises 
immobilizing virus onto said a support e.g. nitrocellulose 
filters, etc., disrupting the virion and hybridizing with 
labelled (radiolsbelled or "cold* fluorescent- or 
enzyme-labelled) probes. Such an approach has already been 
30 developed for Hepatitis B virus in peripheral blood 
(according to SCOTTO J. at al. Hepatology (1BB3), 3. 
379-384) . 

Prnh#* ar.r.nrdlng to the invention can also be used 
for rapid screening of genomic ONA derived from the tissue 
35 of patients with LAV related symptoms, to see if the pro- 
viral ONA or RNA is present in host tissue and other 
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tissues• 

A method which can be used for such screening 
comprise the following steps t extraction of ONA from tis¬ 
sue. restriction enzyme eleavsge of said ONA, sleetro- 
g phoresis of the fragments and Southern blotting of genomic 
ONA from tissues, subsequent hybridization with labelled 
cloned LAV provival DNA. Hybridization Jji situ can also be 
used. 

Lymphatic fluids and tissues and other non-lympha- 
10 tic tissues of humans, primates and other mammalisn 
species can also be screened to see if other evolutionnsry 
related retrovirus exist. The methods referred to here- 
above can be used, although hybridization and washings 
would be done under non stringent conditions. 

15 The ONA according to the invention can be used 

also for achieving the expression of LAV viral antigens 
for diagnostic purposes. 

The invention also relates to the polypeptides 
themselves which can be expressed by the different DNAs of 
20 the inventions, particularly by the ORFs or fragments 
thereof, in appropriate hosts, particularly procaryotic or 
euearyotie hosts, after transformation thereof with a 
suitable vector previously modified by the corresponding 
DNAs. 

25 These polypeptides can be used as diagnostic 

tools, particularly for the detection of antibodies in 
biological media, particularly in sera or tissues of 
persons afflicted with pre-AIDS or AIOS, or simply 
carrying antibodies in the absence of any apparent 
30 disorders. Conversely the different peptides according to 
this invention ean be used themselves for the production 
of antibodies, preferably monoclonal antibodios specific 
of the different peptides respectively. For the production 
of hybridomas secreting said monoclonal antibodias 
35 conventional production and screening methods are used. 
These monoclonal antibodies, which themselves are part of 
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the invention then provide very useful tools for the 
identification and even determination of relative 

proportions of the different polypeptides or proteins in 
biological samples, particularly human samples containing 
S LAV or related viruses. 

Thus all of the above peptides can be used in 
diagnostics as sources of immunogens or antigens free of 
viral particles, produced using non-permissive systems, 
and thus of little or no biohazard risk. 

10 The invention further relates to the hosts (proca¬ 

ryotic or eucaryotie calls) which are transformed by the 
above mentioned recombinants and which are capable of 
expressing said DNA fragments. 

Finally it also relates to vaccine compositions 

19 whose active principle is to be constituted by any of the 
expressed antigens, i.e. whole antigens, fusion polypep¬ 
tides or oligopeptides in association with a suitable 
pharmaceutical or physiologically acceptable carrier. 

Preferably the active principles to be considered 

>6 in that field consist of the peptides containing less than 
250 aminoaeid units, preferably less than 150 as deducible 
for the complete genomas of LAV, and even more preferably 
those peptides which contain one or more groups selected 
from N-X-S and N-X-T as defined above. Preferred peptides 

29 for use in the production of vaccinating principles are 
peptides (a) to (f) as defined above. By way of example 
having no limitative character, there may be mentioned 
that suitable dosages of the vaccine compositions are 
those which enable administration to the host, 

30 particularly human host ranging from 10 to 500 mierograms 
per kg, for Instance SO to 100 mierograms per kg. 

For the purpose of clarity figs. 19 to 26 are 
added, reference may be made thereto in case of difficul¬ 
ties of reading blurred parts of figs. 4 to 12. 


35 
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Needless to say that fig*. 19-26 ira merely a 
reiteration of tha whola ONA sequence of tha LAV genome. 

Finally tha invention alao concarna vaetora for 
tha tranaformatlon of aucaryotic calla of human origin, 
particularly lymphocytaa, tha polymaraaaa of which ara 
capable of recognizing tha LTRa of LAV. Particularly aaid 
vaetora ara characterized by tha praaanea of a LAV LTR 
therein, aaid LTR being than active aa a promoter enabling 
tha efficient tranacription and translation in a suitable 
hoat of tha above defined, o'f a ONA insert coding for a 
determined protein placed under its controls. 

Needless to aay that the invention extends to all 
variants of genomes and corresponding ONA fragments (ORFs) 
having substantially equivalent properties, all of aaid 
genomes belonging to retroviruses which can be considered 
as equivalents of LAV. 
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CLAIMS 

1. A ONA fragment of LAV extending from nucleotide 
position 236 to nudootido position 1759. 

2. A ONA fragment of LAV oxtonding from nuclootido 
g position 1555 to nuclootido position 5066. 

3. A ONA fragment of LAV oxtonding from nuclootido 
position 5670 to nuclootido position 8132. 

4. A voetor containing a ONA fragment according to 
any of claims 1 to 3. 

10 5. Peptide corresponding to any of those encoded 

by the nucleotide sequences which extend respectively 
between the following positions : 



a) from about 6Q95 

to 

about 

6200 



b) 

" 6260 

m 

* 

6310 


15 

c) 

* 6390 

m 

• 

6446^ 



d) 

' 6485 

m 

* 

6620 



e) 

* 6880 

m 

« 

6930 



f) 

* 7535 

m 

• 

7630 - 




6. Peptide 

characterized 

by a saquanca of amino- 

20 

acids 

deducible from LAV 

ONA the 

tarminai aminoaeids of 


which 

extend betwe 

an 

the f< 

illowing 

positions with raspact 


to the lysine (position 1) coded by the AAA at position 
5870*5672 in the LAV ONA. 

8*23 aminoaeids inclusive 
25 63-78 

62-90 
97-723 * 

127-183 • 

197-201 * . “ 

30 239-294 * 

300-327 ' 

334-381 * 

397-424 " 

466-500 " 

35 510-523 " 

551-577 “ 









1 
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s 


10 


IS 


20 


25 


30 


994-803 * 

821-830 * 

897-879 - 

719-790 " 

700-803 * 

or any combination of thaao poptidaa. 

7. Poptido corrotponding to tho aminoacid 
sequences doduciblo from LAV DMA and tho terminal 
aminoacida of which are poaitionned at tho poaitiona 
horoaftor counted from tho Hot at poaition 1 coded by the 
ATG sequence at nucleotide poaitiona 280-2 : 

12-32 aminoacida inclusive 
37-48 
49-79 
08-193 
190-169 
170-190 
200-220 
226-234 
239-264 
288-331 
392-381 
377-390 
399-432 
437-484 
492-498 

and combination of said peptidea. 

8. Diagnostic means containing any of the DMA 
fragments of any of claims 1 to 3. 

9. Diagnostic means containing any of the peptides 
of any of elaima 4 to 6. 

10. Vaccine compositions containing any of the 
peptidea aeeording to any of elaima 4 to 8 in association 
with a pharmaceutical vehicle. 


E~*y\c/ 0^ 
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